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^ , Abstract 

We study the randomized fc-server problem on metric spaces consisting of widely 
separated subspaces. We give a method which extends existing algorithms to larger 
spaces with the growth rate of the competitive quotients being at most O(logfc). 
This method yields o(/c)-competitive algorithms solving the randomized /c-server 
problem, for some special underlying metric spaces, e.g. HSTs of "small" height 
(but unbounded degree). HSTs are important tools for probabilistic approximation 
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q | of metric spaces. 
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J£j ; 1 Introduction 

(N 

00 ■ In the theory of designing efficient virtual memory-management algorithms, the well 

studied paging problem plays a central role. Even the earliest operation systems con- 
tained some heuristics to minimize the amount of copying memory pages, which is an 
expensive operation. A generalization of the paging problem, called the fc-server prob- 
lem was introduced by Manasse, McGeoch and Sleator in [H], where the first important 
results were also achieved. The problem can be formulated as follows. Given a metric 
space with k mobile servers that occupy distinct points of the space and a sequence of 
requests (points), each of the requests has to be served, by moving a server from its 
current position to the requested point. The goal is to minimize the total cost, that is 
the sum of the distances covered by the servers; the optimal cost for a given sequence g 
is denoted opt(g). 

An algorithm is online if it serves each request immediately when it arrives (without 
any prior knowledge about the future requests). 

Definition 1. An online algorithm A is c- competitive if for any initial configuration Cq 
and request sequence g it holds that 

cost(A(g)) < c ■ opt(e) + I{C ), 

where I is a non-negative constant depending only on Cq. 
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The competitive ratio of a given online algorithm A is the infimum of the values c 
with A being c-competitive. The fc-server conjecture (see [T4]) states that there exists 
an algorithm A that is ^-competitive for any metric space. Manasse et al. proved that 
k is a lower bound [14], and Koutsoupias and Papadimitriou showed 2k — 1 is an upper 
bound for any metric space [T2] . 

In the randomized online case (sometimes this model is called the oblivious adversary 
model [5]) the competitive ratio can be defined in terms of the expected value as follows: 

Definition 2. A randomized online algorithm R is c-competitive if for any initial con- 
figuration Co and request sequence g we have 

E(cost(R(g))) < c ■ opt(g) + I (Co), 

where I is a non-negative constant depending only on C and E(cost(i?(g))) denotes the 
expected value of cost (R(g)) . 

The competitive ratio of the above randomized algorithm is defined analogously. 

In the randomized version there are more problems that are still open. The random- 
ized /c-server conjecture states that there exists a randomized algorithm with a competi- 
tive ratio 0(log k) in any metric space. The best known lower bound is f2(log kj log 2 log k) 
[3]. A natural upper bound is the bound 2k + 1 given for the deterministic case. By re- 
stricting our attention to metric spaces with a special structure, better bounds can be 
achieved: Fiat et al. showed a lower bound H k = ^2i=o ^ 1 ~ 1°§ ^ f° r uniform metric 
spaces [ID] , which turned out to be also an upper bound, see McGeoch and Sleator, [1 3j . 

In this paper we also consider a restriction of the problem, namely we seek for an 
efficient randomized online algorithm for metric spaces that are "/z-HST spaces" [H E] 
and defined as follows: 

Definition 3. A ^-hierarchically well-separated tree (n-HST) is a metric space defined 
on the leaves of a weighted, rooted tree T with the following properties: 

1. The edge weight from any node to each of its children is the same. 

2. The edge weights along any path from the root to a leaf are decreasing by the factor 
H from one level to the next. The weight of an edge incident to a leaf is one. 

The /^-HST spaces play an important role in the so-called metric space approximation 
technique developed by Bartal [2]. Fakcharoenphol et al [IT] proved that every weighted 
graph on n vertices can be a-probabilistically approximated by a set of /z-HSTs, for an 
arbitrary /i > 1 where a = 0(/ilogn/ log/z). 

It has been shown in [15] that for any 2fc-HST with an underlying tree T that has 
a small depth and maximum degree there exists a polylog(/c)-competitive randomized 
algorithm for the fc-server problem. By slightly modifying the approach of Csaba and 
Lodha [7] and Bartal and Mendel [4] we show that there exists such an algorithm for any 
/z-HST that has a small depth and arbitrary maximum degree t, given /i > minj/c, t}. 

2 Notation 

Suppose the points of a metric space can be partitioned into t blocks, Bi, . . . ,B t , such 
that the diameter of each block is at most S and whenever x and y are points of different 
blocks, their distance is exactly A. Suppose also that A/ 5 = jj, > k holds. The above 
metric space is /i-decomposable [15] . 



For a given request sequence g we denote its ith member by Qi, and the prefix of g 
of length i by g<i. 

Given a block B s , a request sequence g having only requests from B s and a number £ 
of servers inside B s and an algorithm A, let cost(A s (£, g)) denote the cost computed by 
the algorithm A for these inputs and opt s (£, g) (in the latter case also the initial position 
of the servers can be chosen). If g is nonempty, opt s (0, g) is defined to be infinite. 

Definition 4. The demand of the block B s for the request sequence g that contains only 
requests from g is 

D s (g) := min{£ | opt, (4 g) + £A = min{opt s (j, g) + jA}}, 

3 

if g is nonempty, otherwise it is 0. 

Visually, D s (g) denotes the least number of servers to be moved into the initially 
empty block B s to achieve the optimal cost for the sequence g. We note that the behaviour 
of the sequence D s (g 1 ), D s (g< 2 ), ■ ■ ■ , D s (g<i), . . . , D s (g) is unclear. 



3 Algorithm X 

In the rest of the paper we suppose that there exists a randomized online algorithm A 
and a function / with being monotone increasing (constants are allowed), and we 
have 

E[cost(M£, g))} < f{£) ■ opt.(€, g) + (1) 

for any £ and s. Having Algorithm A, we can define our "shell algorithm" X that uses A 
as a subroutine inside the blocks. 



3.1 The Algorithm 

The algorithm uses A as a subroutine and it works in phases. Let g^ denote the sequence 
of the pth phase. In this phase the algorithm works as follows: 

Initially we mark the blocks that contain no servers. 

When gf \ the ith request of this phase arrives to block B s , we compute the 
demand D s (g^l) and the maximal demand 

D;( ff «)=niax{D.(4>)|j<i} 

for this block (note that these values do not change in the other blocks). 

- If D*(gf^) is less than the number of servers in B s at that moment, then 
the request is served by Algorithm A, with respect to the block B s . 

- If D*(g^) becomes equal to the number of servers in B s at that moment, 
then the request is served by Algorithm A, with respect to the block B s and 
we mark the block B s . 

- If D^gf'') is greater than the number of servers in B s at that moment, we 
mark the block B s and perform the following steps until we have D*(g[ p ^) 
servers in that block or we cannot execute the steps (this happens when all 
the blocks become marked): 



• Let us choose an unmarked block B s / randomly uniformly, and a server 
from this block also randomly. We move this chosen server to the block 
B s (such a move is called a jump), either to the requested point, or, 
if there is already a server occupying that point, to a randomly cho- 
sen unoccupied point of B s . If the number of servers in B s > becomes 
D*,(of') via this move, we mark that block. In both B s and B s > we 
restart algorithm A from the current configuration of the block. 

If we cannot raise the number of servers in block B s to D*(g\ pS> ) by repeating 
the above steps (all the blocks became marked), then Phase p + 1 is starting 
and the last request is belonging to this new phase. 

Our main result is the following: 
Theorem 5. Algorithm X is c ■ logfc • / (k) -competitive for some constant c. 

In the following two subsections we will give an upper bound for the cost of Algorithm 
X and several lower bounds for the optimal cost in an arbitrary phase. The above theorem 
easily follows from these. 

For convenience we modify the request sequence g in a way that does not increase the 
optimal cost and does not decrease the cost of any online algorithm, hence the bounds 
we get for this modified sequence will hold also in the general case. The modification 
is defined as follows: we extend the sequence by repeatedly requesting the points of the 
halting configuration of a (fixed) optimal solution. We do this till Y^l=i D*(q<}) becomes 
k. Observe that the optimal cost does not change via this transformation, and any online 
algorithm works the same way in the original part of the sequence (hence online), so the 
cost computed by any online algorithm is at least the original computed cost. 

3.2 Upper bound 

In the first step we prove an auxiliary result. We recall from [8] that an online match- 
ing problem is defined similarly to the online /c-server problem with the following two 
differences: 

1. Each of the servers can moved only once; 

2. The number of the requests is at most k, the number of the servers. 

For any phase p of Algorithm X we can associate the following matching problem MX. 
The underlying metric space of the matching problem is a finite uniform metric space 
that has the blocks B s as points and a distance A between any two different points. Let 
D s (p) denote the number of servers that are in the block B s just at the end of phase p. 
Now in the associated matching problem we have D s (p — 1) servers originally occupying 
the point B s . During phase p, if some value D* increases, we make a number of requests 
in point B s for the associated matching problem: we make the same number of requests 
that the value D* has been increased with. 

We also associate an auxiliary matching algorithm (AMA) on this structure as follows. 
Suppose D* increases at some time, causing jumps. These jumps are corresponding to 
requests of the associated matching problem; AMA satisfies these requests by the servers 
that are corresponding to those involved in these jumps. 



Let D s (p) denote the number of servers in block B s just after phase p. If p is not the 
last phase, let g^' + denote the request sequence we get by adding the first request of 
phase p + 1 to (P> . Now we have 

D* s (q^) < D s (p) < D*s(Q (p)+ ) (2) 

and in all block but at most one we have equalities there (this is the block that causes 
termination of the pth phase). 
Denote 



m p 



:=^max{0,D s (p)-4(p-l)}- (3) 



s=l 



Since the auxiliary metric space is uniform, the optimal cost is Am p . 

Lemma 6 (Csaba, Pluhar, [8]). The expected cost of AMA is at most \ogk ■ Am p . 

Lemma 7. The expected cost of Algorithm X in the pth phase is at most 



f(k) (X>pt a (D a ( e M + ), e t 

+Am p (f(k) log k + f(k) + log k) + A 




log k 



Proof. Consider the pth phase of a run of Algorithm X on the request sequence g and 
let r denote the associated run of AMA. 

Let B s be a block in which some request arrives during this phase. For the sake 
of convenience we omit the index s of the block: let opt(£, g') := opt s (£,g'), D p -\ : = 
D s (p-1) &ndD p :=D s (p). 

Denote g^ the restiction of g to B s . While the block is unmarked, only jump-outs 
can happen from this block; let these jump-outs happen just before the rxth, . . . ,r d -th 
request of g^ p \ respectively. After the block has been marked, only jump-ins can happen; 
let these happen when the r^-+ith,. . . , r^+^+th request arrives, respectively (for any 
given request there can be more preceeding jumps). Denote <Tj = g n . . . g n+1 -\ (where 
g ro is the first and £V d _ +d++1 -i is the last request of the phase in B s ), and let k s ^ := ki 
be the number of servers in B s during a,. Observe that the demand at the rjth request 
is exactly ki. Finally, let L L denote the demand occuring at the r; t — 1th request if this 
request falls into the pth phase; otherwise let £i = 0. 

A jump-in to the block satisfies the last request, hence there is no server movement 
inside the block during a jump. The expected cost of non-jump movements in this block 
(this is called the inner cost) is, applying (Dp), at most 

E[A s (k u a t )\r] < ^ {f(k t )opt(k u a t ) + ' f ^ 6) 
i=0 i=o °^ * 

d~ +d + d~ +d+ , f/i \ 

< f(k) opt(A^) + 5 £ «j£gl. (4) 

We bound the right side of (J4j) piecewise. Summing up till the jump-out just before the 
last: 

d--l <f--l 

^2 opt(&i, at) < ^2 opt(k d - , at) < opt(k d -, g<l d _)- (5) 

i=0 i=Q 



From the last jump-out till the last jump-in: 

d-l d-l 



^ Opt(fci, CTj) < ^ Opt(£ i+ i, (Ti) 
i=d~ i=d~ 
d-l 

= (°pt(^+i> *0 + °pt(4+i. eSj - opt(£j+i, egj) 

< £ (o P t(£ i+1 , g§ i+1 ) - o P t(4 + i, egj) (6) 

i=d~ 
d-l 

i=d~ 

-opt(k t ,gf n )-(k t -e i+1 )A) 

d-i 

i=d~ 

= opt{k d ,g^, d )-opt{k d -,g^ r ) + {k d -k d -)A, (7) 

where d = d~ + d + . Inequality j6j) comes from Definition [H since the demand of Q< ri+1 
is £i + i and the demand of g< n is ki. 

Since k d > D(g^), analogously we get 

opt(& d , a d ) < opt( J D( f? (p) ),a d ) < 

< o P t(L»(^), e W) - o P t(L»(^), ggj < 



< opt( J D( f? (p)+ ), + (D(g {p)+ ) - D(qV>))A 

M \ 

?<r d l 



o V t(k d ,gf r )-{k d -D(g^))A 



= opt(L>(^+), *><">) _ opt ( fcdj ggj + p(f><P>+) - fc d )A. (8) 

Observe that if the request causing termination of phase p, that is the first request of 
phase p + 1, then D(g^) = D(g^ + ) holds. 

Summing up the right hand sides of (jSJ), (J3) and (jHJ) we get 

d-+d+ 

£ optfo, a,) < opt( J D(^ + ), e W) + (D(g^ + ) - k d -)A, (9) 

and substituting this to the right hand side of pj we get that the expected inner cost in 
B s is at most 

f(k) (opt(L>(^)+), g W) + _ fc,_) A ) + £ Vl^- ( 10 ) 

i=o logfci 

On the other hand, 

D(g^ )+ ) - k d - = (D(g® + ) - D p ) + (D p - k d -), (11) 

where we know that D(g^ + ) = and (D p — k d -) is the number of jump-ins 

into this block. 



We can bound the sum of the expressions of the form ^{J^^ fi as follows: 

s=l i=0 to s=l i=0 to to 

where \t\ is the number of the jumps during the phase. This comes from the fact that 
we have only k servers, hence the sum of k s (the number of servers in the target block of 
the jump) and k s i (where the given server jumps from) is still at most k. We also remark 
that 

|r| < k. (13) 

Now we bound the cost of the jumps. Let T be the set of the potential jumps, and the 
length of the jump sequence r\. Applying Proposition [6] we get that 

E[r]A] = ^2 P{r)\r\ A < log k ■ Am p (14) 

Summing up the results ( TTOl) , ( TTTI ). (fT2l) and ( fT4l ) for all the blocks we get the following 
bound for the expected cost of Algorithm X: 

t dj+df 



E E[A a (ki,(Ti)+V^] (15) 

s=l i=0 

(t dj+d+ \ 
P (^)E E HMK^lr] )+AE[ V } (16) 

t 

< ^P(r)/(fc)^o P t( J D s (^) + ),^) + )+ (17) 

t£T s=1 

t 

+ nr)f{k) J2(D(Q {P)+ ) - D p ) A + (18) 



T£T s=l 



E p ( r )/( fc )i r i A + E p ( r )(M + + ae w ( 19 ) 



log/c 

< /(*) ($^opt s ( J D s (^ + ),^) + A^ J D s (^ + )-A;A + logA;-Am ? 



,s=l s=l 

+A -m„ log fc + + m p log fc 

\log/e log/e 

if we apply J^rer P ( r ) = 1 in (P5 and fl32]), and M < A in (P2J). □ 



3.3 Analyzing the optimal cost 

Consider an optimal solution of the fc-server problem. Let C*(g) be the maximal number 
of servers in B s of this optimal solution during phase p and C s (g) be the number of 
servers in B s of the optimal solution at the end p. If we duplicate the first request of 
every phase but the very first one, and consider g^ + instead of the optimal cost 
does not change. 
Denote 

t 

ma;:=Y,(C:(g {p)+ )-C s (g^+)). 

s=l 



From the definitions above it is clear that the optimal cost is at least 

t 

opt(0 w+ ) > ^opt s (C;( f? ^ + ), + A ■ ma* p . (20) 

s=l 

Lemma 8. 

t / i 

opt(A;, g^+) > ^opt( J D s (^+), £^ + ) + A ^D fl (^+) - fc 

s=l \s=l 

Proof. From Definition [4] we have 

^ (opt s (C;(^ + ), e <"> + ) + A(C* s (g® + ) - C s (g^ + ))) > 

8=1 
8=1 

Since J2l=i C s (g^+) = k, the statement follows hv ([20]) . □ 
Lemma 9. The optimal cost is at least 

-A m n . 

P>1 

Proof. Let gfo)* be the subsequence of g^ which we get by omitting each request that 
arrives to a block B s after the demand of that given block reaches -D*(£>^ + ) (note that 
g(p)* j s no t neccessarily a prefix of g^). Now we have two cases: first, if D*(p^ + ) > 
C s (g( p -^ + ) holds, then by Definition!] 

op t,(c;(^ + ), + A(cj( e w+) - c s (£ (p - 1)+ )) 

> op t s (c;(^ + ), + a(<%m+) - c s (^- 1)+ )) 

> (opt s (L>:(^) + ), + A (0, J D:(^ p ) + ) - C s (^^ 1 )+))) 

> Am^O,^!^) - C s (^ 1)+ ))}- 
Otherwise it holds that 

max{0,D s (g^)-C s (g^ + ))} = 0, 

and also obviously 

op t s (c;(^ + ), ^ + ) + a(c;(^) + ) - c s (^ + )) > o. 

From both cases we get 

qpt.(<% W+ ), g(p)+) + A(C:(^)+) - > (21) 

Amax{0, J D,(^))-C s (^ 1 )+))}. 
t t 

Now since £ £> s (£ (p) ) = £ C s (^ 1)+ ) = fc, it also holds that 

8=1 S=l 
t 

^Amax{0,4(e W )-C.(^ 1 J + ))} = 

^iA|4(ff W )-C.(^ 1 )+))|. (22) 



s=l 



2 

s=l 



Summing the cost of the jumps that the optimal solution performs we get 

t 

AY,\C s (q {p)+ )-C s (q^ + )\ <2-opt(A;,^) + ). (23) 

s=l 

Note that the factor of 2 comes from the fact that each jump appears twice on the left 
hand side. Now summing up (l2~TT) . f l22l and f j23~l) we have 

4 • opt{k, g ip)+ ) 
t 

> A {\Ds{Q [p) ) - C s {d p - 1)+ ))\ + \Cs(g {p)+ ) - C s (g^ + )\) 

s=l 

t 

> Aj2\Ds(Q (p) ) - C s (Q ip)+ ))\. (24) 

s=l 

Now summing ( l2Ti ) relativized to phase p and (1241) relativized to phase p-1 we get that 

2 • opt(A;, g^+) + 4 • opt(A;, g^ + ) 
t 

> A {\Ds(Q {p) ) - C s ( e ^ + ))\ + \D s (g^) - C s (g^ + ))\ 

s=l 

t 

> Aj2\Ds(Q (p) )-D s (Q ip ' 1) ))\ = ^rn p , (25) 

s=l 

and the statement follows. □ 
Lemma 10. If the pth phase is not the last one, then opt(fc, g {p)+ ) > A. 
Proof. Similarly to the proof of Lemma [9j 

opt s (C s *(^) + ), + A(C* s (gW+) - C s (g^+)) 

> o P t,(Q(^ + ), + a(c;(^ + ) - c s (^ p - 1 ) + )) 

> (<*rt,(I%M+), + A (0, D* s (g^ + ) - C s (g^+))) 

> Amax{0,D a ( e <*>)-C 8 (Q<*-Q + ))}, 

so we get that the optimal cost is at least 
t 

£ {opt s (D:(g^ + ), + A( j D:(^^ + ) - C s (^" 1)+ ))). (26) 

s=l 

Now since J^ s=1 = fc, moreover if the pth phase is not the last one, then 

Z/3=i D*(g^ + ) > k also holds; applying these we get the statement. □ 

3.4 Proof of Theorem [5] 

Now we are able to prove the theorem about competitiveness of Algorithm X. 

Proof. [Theorem [5] If we apply ( ITBT ) to the first phase and write k instead of m p log£;, 

then Lemmas [3, El [9] and [TO] give that 

E(cost{K A{g))) < /(fc)opt(fc, g) + 

6 • opt(fc, g) (f(k) log k + f(k) + log k) + 

/(*) 



A 



log k 

f'(k) ■ kA 



< f'(k)-opt(k,g) + 



log A; 



where f'(k) = f(k)(6\ogk + 8). 

At this point we make a few remarks: 



□ 



1. If the starting configuration of the online problem coincides with the starting con- 
figuration of an optimal solution, then it holds that E(KA(g)) < c ■ f(k) ■ log A; • 
opt(/c, q). 

2. It is a bit more natural to require A > 5M to hold, where M is the size of the 
greatest block. If additionally M < k holds, we get a better competitive ratio. 

3.5 Corollaries 

Starting from the MARKING algorithm [10] and iterating Theorem[5]we get the following 
result: 

Corollary 11. There exists a (ci log k) h -competitve randomized online algorithm on 
any ji-HST of height h (here /J, > k), where C\ is a constant. Consequently, when 
h < -j rf^-j — r, this alqorithm is o(k) -competitive. 

log ci +log log k ' 3 \ j f 

Considering only metric spaces having at most t < k blocks, it is enough to require 
/i > t. Substituting this to Lemma[7] we get a competitive ratio of c • logfe ■ /(A;), what 
(applying the above corollary) gives us the following result: 

Corollary 12. Suppose the metric space is a [i-HST having only degrees of at most b. 
Then there exists a c\-competive randomized online algorithm, where h is the depth of 
the tree and c<i = 02(b) is some constant. 

4 Further questions 

In the field of online optimization the concept of buying extra resources is also investi- 
gated [9j. The quantity min£{opt s (£, g) +£A} can be seen as the optimal cost of a model 
where one has to buy the servers, for a cost of A each. This problem on uniform spaces 
was studied in [9j. In this case D s (g) is the number of servers bought in an optimal so- 
lution. Now considering to sequence ft, the behaviour of the associated sequence D s (gi) 
is unclear at the moment. It is an interesting question whether the above sequence is 
monotonically increasing, or does it hold that \D s (g)i — D s (g) i+ i\ < 1 for each i. The 
presented proofs would substantially simplify in both cases. 

Another interesting question is that whether the log k factor in the competitive ratio 
per level of the HST is unavoidable, or an overall competitive ratio of 0(log k) holds for 
any HST. 
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